NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Behavioral Analysis of Information Salience in Large Language Models

https://doi.org/10.18653/v1/2025.findings-acl.1204

Trienes, Jan; Schlötterer, Jörg; Li, Junyi Jessy; Seifert, Christin (July 2025, Findings of the Association for Computational Linguistics: ACL 2025)

Large Language Models (LLMs) excel at text summarization, a task that requires models to select content based on its importance. However, the exact notion of salience that LLMs have internalized remains unclear. To bridge this gap, we introduce an explainable framework to systematically derive and investigate information salience in LLMs through their summarization behavior. Using length-controlled summarization as a behavioral probe into the content selection process, and tracing the answerability of Questions Under Discussion throughout, we derive a proxy for how models prioritize information. Our experiments on 13 models across four datasets reveal that LLMs have a nuanced, hierarchical notion of salience, generally consistent across model families and sizes. While models show highly consistent behavior and hence salience patterns, this notion of salience cannot be accessed through introspection, and only weakly correlates with human perceptions of information salience.
more » « less
Free, publicly-accessible full text available July 1, 2026
InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification

Trienes, Jan; Joseph, Sebastian; Scholotterer, Jorg; Seifert, Christin; Lo, Kyle; Xu, Wei; Wallace, Byron C; Li, Jessy (August 2024, Association for Computational Linguistics)

Full Text Available
InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification

https://doi.org/10.18653/v1/2024.acl-long.234

Trienes, Jan; Joseph, Sebastian; Schlötterer, Jörg; Seifert, Christin; Lo, Kyle; Xu, Wei; Wallace, Byron; Li, Junyi Jessy (January 2024, Association for Computational Linguistics)

Full Text Available
Report on the Dagstuhl Seminar on Frontiers of Information Access Experimentation for Research and Education

https://doi.org/10.1145/3636341.3636351

Bauer, Christine; Carterette, Ben; Ferro, Nicola; Fuhr, Norbert; Beel, Joeran; Breuer, Timo; Clarke, Charles_L A; Crescenzi, Anita; Demartini, Gianluca; Di_Nunzio, Giorgio Maria; et al (June 2023, ACM SIGIR Forum)

This report documents the program and the outcomes of Dagstuhl Seminar 23031 Frontiers of Information Access Experimentation for Research and Education, which brought together 38 participants from 12 countries. The seminar addressed technology-enhanced information access (information retrieval, recommender systems, natural language processing) and specifically focused on developing more responsible experimental practices leading to more valid results, both for research as well as for scientific education. The seminar featured a series of long and short talks delivered by participants, who helped in setting a common ground and in letting emerge topics of interest to be explored as the main output of the seminar. This led to the definition of five groups which investigated challenges, opportunities, and next steps in the following areas:reality check, i.e. conducting real-world studies, human-machine-collaborative relevance judgment frameworks, overcoming methodological challenges in information retrieval and recommender systems through awareness and education, results-blind reviewing, and guidance for authors. Date:15--20 January 2023. Website:https://www.dagstuhl.de/23031.
more » « less
Full Text Available

Search for: All records